Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 36275 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3138 |
| Duplicate rows (%) | 8.7% |
| Total size in memory | 5.0 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 10 |
| Dataset has 3138 (8.7%) duplicate rows | Duplicates |
no_of_previous_bookings_not_canceled is highly overall correlated with repeated_guest | High correlation |
repeated_guest is highly overall correlated with no_of_previous_bookings_not_canceled | High correlation |
no_of_adults is highly imbalanced (52.4%) | Imbalance |
required_car_parking_space is highly imbalanced (80.1%) | Imbalance |
room_type_reserved is highly imbalanced (62.5%) | Imbalance |
repeated_guest is highly imbalanced (82.8%) | Imbalance |
no_of_previous_cancellations is highly skewed (γ1 = 25.19987595) | Skewed |
no_of_children has 33577 (92.6%) zeros | Zeros |
no_of_weekend_nights has 16872 (46.5%) zeros | Zeros |
no_of_week_nights has 2387 (6.6%) zeros | Zeros |
lead_time has 1297 (3.6%) zeros | Zeros |
no_of_previous_cancellations has 35937 (99.1%) zeros | Zeros |
no_of_previous_bookings_not_canceled has 35463 (97.8%) zeros | Zeros |
avg_price_per_room has 545 (1.5%) zeros | Zeros |
no_of_special_requests has 19777 (54.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-20 05:34:26.060543 |
|---|---|
| Analysis finished | 2025-03-20 05:34:31.944574 |
| Duration | 5.88 seconds |
| Software version | ydata-profiling vv4.14.0 |
| Download configuration | config.json |
Variables
no_of_adults
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 2317 |
| 0 | 139 |
| 4 | 16 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 26108 | |
| 1 | 7695 | 21.2% |
| 3 | 2317 | 6.4% |
| 0 | 139 | 0.4% |
| 4 | 16 | < 0.1% |
no_of_children
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10527912 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 33577 |
| Zeros (%) | 92.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.40264806 |
|---|---|
| Coefficient of variation (CV) | 3.8245767 |
| Kurtosis | 36.981856 |
| Mean | 0.10527912 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.7103495 |
| Sum | 3819 |
| Variance | 0.16212546 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33577 | |
| 1 | 1618 | 4.5% |
| 2 | 1058 | 2.9% |
| 3 | 19 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 33577 | |
| 1 | 1618 | 4.5% |
| 2 | 1058 | 2.9% |
| 3 | 19 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 3 | 19 | 0.1% |
| 2 | 1058 | 2.9% |
| 1 | 1618 | 4.5% |
| 0 | 33577 |
no_of_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.81072364 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 16872 |
| Zeros (%) | 46.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.87064361 |
|---|---|
| Coefficient of variation (CV) | 1.0739092 |
| Kurtosis | 0.29885756 |
| Mean | 0.81072364 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.73761596 |
| Sum | 29409 |
| Variance | 0.7580203 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16872 | |
| 1 | 9995 | |
| 2 | 9071 | |
| 3 | 153 | 0.4% |
| 4 | 129 | 0.4% |
| 5 | 34 | 0.1% |
| 6 | 20 | 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 16872 | |
| 1 | 9995 | |
| 2 | 9071 | |
| 3 | 153 | 0.4% |
| 4 | 129 | 0.4% |
| 5 | 34 | 0.1% |
| 6 | 20 | 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 20 | 0.1% |
| 5 | 34 | 0.1% |
| 4 | 129 | 0.4% |
| 3 | 153 | 0.4% |
| 2 | 9071 | |
| 1 | 9995 | |
| 0 | 16872 |
no_of_week_nights
Real number (ℝ)
Zeros 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2043005 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 2387 |
| Zeros (%) | 6.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4109049 |
|---|---|
| Coefficient of variation (CV) | 0.6400692 |
| Kurtosis | 7.7982839 |
| Mean | 2.2043005 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.5993504 |
| Sum | 79961 |
| Variance | 1.9906525 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 11444 | |
| 1 | 9488 | |
| 3 | 7839 | |
| 4 | 2990 | 8.2% |
| 0 | 2387 | 6.6% |
| 5 | 1614 | 4.4% |
| 6 | 189 | 0.5% |
| 7 | 113 | 0.3% |
| 10 | 62 | 0.2% |
| 8 | 62 | 0.2% |
| Other values (8) | 87 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 2387 | 6.6% |
| 1 | 9488 | |
| 2 | 11444 | |
| 3 | 7839 | |
| 4 | 2990 | 8.2% |
| 5 | 1614 | 4.4% |
| 6 | 189 | 0.5% |
| 7 | 113 | 0.3% |
| 8 | 62 | 0.2% |
| 9 | 34 | 0.1% |
| Value | Count | Frequency (%) |
| 17 | 3 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 10 | < 0.1% |
| 14 | 7 | < 0.1% |
| 13 | 5 | < 0.1% |
| 12 | 9 | < 0.1% |
| 11 | 17 | < 0.1% |
| 10 | 62 | |
| 9 | 34 | |
| 8 | 62 |
type_of_meal_plan
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| Meal Plan 1 | |
|---|---|
| Not Selected | |
| Meal Plan 2 | |
| Meal Plan 3 | 5 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.14142 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Meal Plan 1 |
|---|---|
| 2nd row | Not Selected |
| 3rd row | Meal Plan 1 |
| 4th row | Meal Plan 1 |
| 5th row | Not Selected |
Common Values
| Value | Count | Frequency (%) |
| Meal Plan 1 | 27835 | |
| Not Selected | 5130 | 14.1% |
| Meal Plan 2 | 3305 | 9.1% |
| Meal Plan 3 | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| meal | 31145 | |
| plan | 31145 | |
| 1 | 27835 | |
| not | 5130 | 4.9% |
| selected | 5130 | 4.9% |
| 2 | 3305 | 3.2% |
| 3 | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 67420 | |
| 67420 | ||
| a | 62290 | |
| e | 46535 | |
| M | 31145 | |
| P | 31145 | |
| n | 31145 | |
| 1 | 27835 | |
| t | 10260 | 2.5% |
| N | 5130 | 1.3% |
| Other values (6) | 23830 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 404155 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 67420 | |
| 67420 | ||
| a | 62290 | |
| e | 46535 | |
| M | 31145 | |
| P | 31145 | |
| n | 31145 | |
| 1 | 27835 | |
| t | 10260 | 2.5% |
| N | 5130 | 1.3% |
| Other values (6) | 23830 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 404155 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 67420 | |
| 67420 | ||
| a | 62290 | |
| e | 46535 | |
| M | 31145 | |
| P | 31145 | |
| n | 31145 | |
| 1 | 27835 | |
| t | 10260 | 2.5% |
| N | 5130 | 1.3% |
| Other values (6) | 23830 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 404155 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 67420 | |
| 67420 | ||
| a | 62290 | |
| e | 46535 | |
| M | 31145 | |
| P | 31145 | |
| n | 31145 | |
| 1 | 27835 | |
| t | 10260 | 2.5% |
| N | 5130 | 1.3% |
| Other values (6) | 23830 | 5.9% |
required_car_parking_space
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| 0 | |
|---|---|
| 1 | 1124 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35151 | |
| 1 | 1124 | 3.1% |
room_type_reserved
Categorical
Imbalance 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| Room_Type 1 | |
|---|---|
| Room_Type 4 | |
| Room_Type 6 | 966 |
| Room_Type 2 | 692 |
| Room_Type 5 | 265 |
| Other values (2) | 165 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Room_Type 1 |
|---|---|
| 2nd row | Room_Type 1 |
| 3rd row | Room_Type 1 |
| 4th row | Room_Type 1 |
| 5th row | Room_Type 1 |
Common Values
| Value | Count | Frequency (%) |
| Room_Type 1 | 28130 | |
| Room_Type 4 | 6057 | 16.7% |
| Room_Type 6 | 966 | 2.7% |
| Room_Type 2 | 692 | 1.9% |
| Room_Type 5 | 265 | 0.7% |
| Room_Type 7 | 158 | 0.4% |
| Room_Type 3 | 7 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| room_type | 36275 | |
| 1 | 28130 | |
| 4 | 6057 | 8.3% |
| 6 | 966 | 1.3% |
| 2 | 692 | 1.0% |
| 5 | 265 | 0.4% |
| 7 | 158 | 0.2% |
| 3 | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 72550 | |
| R | 36275 | |
| m | 36275 | |
| _ | 36275 | |
| T | 36275 | |
| y | 36275 | |
| p | 36275 | |
| e | 36275 | |
| 36275 | ||
| 1 | 28130 | 7.0% |
| Other values (6) | 8145 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 399025 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 72550 | |
| R | 36275 | |
| m | 36275 | |
| _ | 36275 | |
| T | 36275 | |
| y | 36275 | |
| p | 36275 | |
| e | 36275 | |
| 36275 | ||
| 1 | 28130 | 7.0% |
| Other values (6) | 8145 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 399025 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 72550 | |
| R | 36275 | |
| m | 36275 | |
| _ | 36275 | |
| T | 36275 | |
| y | 36275 | |
| p | 36275 | |
| e | 36275 | |
| 36275 | ||
| 1 | 28130 | 7.0% |
| Other values (6) | 8145 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 399025 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 72550 | |
| R | 36275 | |
| m | 36275 | |
| _ | 36275 | |
| T | 36275 | |
| y | 36275 | |
| p | 36275 | |
| e | 36275 | |
| 36275 | ||
| 1 | 28130 | 7.0% |
| Other values (6) | 8145 | 2.0% |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 352 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85.232557 |
| Minimum | 0 |
|---|---|
| Maximum | 443 |
| Zeros | 1297 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 17 |
| median | 57 |
| Q3 | 126 |
| 95-th percentile | 273 |
| Maximum | 443 |
| Range | 443 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 85.930817 |
|---|---|
| Coefficient of variation (CV) | 1.0081924 |
| Kurtosis | 1.1795941 |
| Mean | 85.232557 |
| Median Absolute Deviation (MAD) | 47 |
| Skewness | 1.2924915 |
| Sum | 3091811 |
| Variance | 7384.1053 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1297 | 3.6% |
| 1 | 1078 | 3.0% |
| 2 | 643 | 1.8% |
| 3 | 630 | 1.7% |
| 4 | 628 | 1.7% |
| 5 | 577 | 1.6% |
| 6 | 519 | 1.4% |
| 8 | 436 | 1.2% |
| 7 | 429 | 1.2% |
| 12 | 412 | 1.1% |
| Other values (342) | 29626 |
| Value | Count | Frequency (%) |
| 0 | 1297 | |
| 1 | 1078 | |
| 2 | 643 | |
| 3 | 630 | |
| 4 | 628 | |
| 5 | 577 | |
| 6 | 519 | |
| 7 | 429 | 1.2% |
| 8 | 436 | 1.2% |
| 9 | 332 | 0.9% |
| Value | Count | Frequency (%) |
| 443 | 22 | 0.1% |
| 433 | 20 | 0.1% |
| 418 | 60 | |
| 386 | 69 | |
| 381 | 2 | < 0.1% |
| 377 | 69 | |
| 372 | 1 | < 0.1% |
| 361 | 5 | < 0.1% |
| 359 | 16 | < 0.1% |
| 355 | 1 | < 0.1% |
arrival_year
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| 2018 | |
|---|---|
| 2017 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017 |
|---|---|
| 2nd row | 2018 |
| 3rd row | 2018 |
| 4th row | 2018 |
| 5th row | 2018 |
Common Values
| Value | Count | Frequency (%) |
| 2018 | 29761 | |
| 2017 | 6514 | 18.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2018 | 29761 | |
| 2017 | 6514 | 18.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 36275 | |
| 0 | 36275 | |
| 1 | 36275 | |
| 8 | 29761 | |
| 7 | 6514 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 145100 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36275 | |
| 0 | 36275 | |
| 1 | 36275 | |
| 8 | 29761 | |
| 7 | 6514 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 145100 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36275 | |
| 0 | 36275 | |
| 1 | 36275 | |
| 8 | 29761 | |
| 7 | 6514 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 145100 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36275 | |
| 0 | 36275 | |
| 1 | 36275 | |
| 8 | 29761 | |
| 7 | 6514 | 4.5% |
arrival_month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4236527 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.0698944 |
|---|---|
| Coefficient of variation (CV) | 0.41352883 |
| Kurtosis | -0.93318896 |
| Mean | 7.4236527 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.34822885 |
| Sum | 269293 |
| Variance | 9.4242517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 5317 | |
| 9 | 4611 | |
| 8 | 3813 | |
| 6 | 3203 | |
| 12 | 3021 | |
| 11 | 2980 | |
| 7 | 2920 | |
| 4 | 2736 | |
| 5 | 2598 | |
| 3 | 2358 | |
| Other values (2) | 2718 |
| Value | Count | Frequency (%) |
| 1 | 1014 | 2.8% |
| 2 | 1704 | 4.7% |
| 3 | 2358 | |
| 4 | 2736 | |
| 5 | 2598 | |
| 6 | 3203 | |
| 7 | 2920 | |
| 8 | 3813 | |
| 9 | 4611 | |
| 10 | 5317 |
| Value | Count | Frequency (%) |
| 12 | 3021 | |
| 11 | 2980 | |
| 10 | 5317 | |
| 9 | 4611 | |
| 8 | 3813 | |
| 7 | 2920 | |
| 6 | 3203 | |
| 5 | 2598 | |
| 4 | 2736 | |
| 3 | 2358 |
arrival_date
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.596995 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7404474 |
|---|---|
| Coefficient of variation (CV) | 0.56039303 |
| Kurtosis | -1.157214 |
| Mean | 15.596995 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.028808569 |
| Sum | 565781 |
| Variance | 76.39542 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 1358 | 3.7% |
| 17 | 1345 | 3.7% |
| 2 | 1331 | 3.7% |
| 4 | 1327 | 3.7% |
| 19 | 1327 | 3.7% |
| 16 | 1306 | 3.6% |
| 20 | 1281 | 3.5% |
| 15 | 1273 | 3.5% |
| 6 | 1273 | 3.5% |
| 18 | 1260 | 3.5% |
| Other values (21) | 23194 |
| Value | Count | Frequency (%) |
| 1 | 1133 | |
| 2 | 1331 | |
| 3 | 1098 | |
| 4 | 1327 | |
| 5 | 1154 | |
| 6 | 1273 | |
| 7 | 1110 | |
| 8 | 1198 | |
| 9 | 1130 | |
| 10 | 1089 |
| Value | Count | Frequency (%) |
| 31 | 578 | |
| 30 | 1216 | |
| 29 | 1190 | |
| 28 | 1129 | |
| 27 | 1059 | |
| 26 | 1146 | |
| 25 | 1146 | |
| 24 | 1103 | |
| 23 | 990 | |
| 22 | 1023 |
market_segment_type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| Online | |
|---|---|
| Offline | |
| Corporate | 2017 |
| Complementary | 391 |
| Aviation | 125 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.5393797 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Offline |
|---|---|
| 2nd row | Online |
| 3rd row | Online |
| 4th row | Online |
| 5th row | Online |
Common Values
| Value | Count | Frequency (%) |
| Online | 23214 | |
| Offline | 10528 | |
| Corporate | 2017 | 5.6% |
| Complementary | 391 | 1.1% |
| Aviation | 125 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 23214 | |
| offline | 10528 | |
| corporate | 2017 | 5.6% |
| complementary | 391 | 1.1% |
| aviation | 125 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 57472 | |
| e | 36541 | |
| l | 34133 | |
| i | 33992 | |
| O | 33742 | |
| f | 21056 | 8.9% |
| o | 4550 | 1.9% |
| r | 4425 | 1.9% |
| a | 2533 | 1.1% |
| t | 2533 | 1.1% |
| Other values (6) | 6239 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 237216 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 57472 | |
| e | 36541 | |
| l | 34133 | |
| i | 33992 | |
| O | 33742 | |
| f | 21056 | 8.9% |
| o | 4550 | 1.9% |
| r | 4425 | 1.9% |
| a | 2533 | 1.1% |
| t | 2533 | 1.1% |
| Other values (6) | 6239 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 237216 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 57472 | |
| e | 36541 | |
| l | 34133 | |
| i | 33992 | |
| O | 33742 | |
| f | 21056 | 8.9% |
| o | 4550 | 1.9% |
| r | 4425 | 1.9% |
| a | 2533 | 1.1% |
| t | 2533 | 1.1% |
| Other values (6) | 6239 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 237216 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 57472 | |
| e | 36541 | |
| l | 34133 | |
| i | 33992 | |
| O | 33742 | |
| f | 21056 | 8.9% |
| o | 4550 | 1.9% |
| r | 4425 | 1.9% |
| a | 2533 | 1.1% |
| t | 2533 | 1.1% |
| Other values (6) | 6239 | 2.6% |
repeated_guest
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| 0 | |
|---|---|
| 1 | 930 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36275 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35345 | |
| 1 | 930 | 2.6% |
no_of_previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.023349414 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 35937 |
| Zeros (%) | 99.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.36833145 |
|---|---|
| Coefficient of variation (CV) | 15.774762 |
| Kurtosis | 732.73568 |
| Mean | 0.023349414 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.199876 |
| Sum | 847 |
| Variance | 0.13566806 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35937 | |
| 1 | 198 | 0.5% |
| 2 | 46 | 0.1% |
| 3 | 43 | 0.1% |
| 11 | 25 | 0.1% |
| 5 | 11 | < 0.1% |
| 4 | 10 | < 0.1% |
| 13 | 4 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 35937 | |
| 1 | 198 | 0.5% |
| 2 | 46 | 0.1% |
| 3 | 43 | 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 11 | < 0.1% |
| 6 | 1 | < 0.1% |
| 11 | 25 | 0.1% |
| 13 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 4 | < 0.1% |
| 11 | 25 | 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 11 | < 0.1% |
| 4 | 10 | < 0.1% |
| 3 | 43 | 0.1% |
| 2 | 46 | 0.1% |
| 1 | 198 | 0.5% |
| 0 | 35937 |
no_of_previous_bookings_not_canceled
Real number (ℝ)
High correlation  Zeros 
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.15341144 |
| Minimum | 0 |
|---|---|
| Maximum | 58 |
| Zeros | 35463 |
| Zeros (%) | 97.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 58 |
| Range | 58 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7541707 |
|---|---|
| Coefficient of variation (CV) | 11.434419 |
| Kurtosis | 457.38009 |
| Mean | 0.15341144 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.250191 |
| Sum | 5565 |
| Variance | 3.0771149 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35463 | |
| 1 | 228 | 0.6% |
| 2 | 112 | 0.3% |
| 3 | 80 | 0.2% |
| 4 | 65 | 0.2% |
| 5 | 60 | 0.2% |
| 6 | 36 | 0.1% |
| 7 | 24 | 0.1% |
| 8 | 23 | 0.1% |
| 10 | 19 | 0.1% |
| Other values (49) | 165 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 35463 | |
| 1 | 228 | 0.6% |
| 2 | 112 | 0.3% |
| 3 | 80 | 0.2% |
| 4 | 65 | 0.2% |
| 5 | 60 | 0.2% |
| 6 | 36 | 0.1% |
| 7 | 24 | 0.1% |
| 8 | 23 | 0.1% |
| 9 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 58 | 1 | |
| 57 | 1 | |
| 56 | 1 | |
| 55 | 1 | |
| 54 | 1 | |
| 53 | 1 | |
| 52 | 1 | |
| 51 | 1 | |
| 50 | 1 | |
| 49 | 1 |
avg_price_per_room
Real number (ℝ)
Zeros 
| Distinct | 3930 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.42354 |
| Minimum | 0 |
|---|---|
| Maximum | 540 |
| Zeros | 545 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 80.3 |
| median | 99.45 |
| Q3 | 120 |
| 95-th percentile | 165 |
| Maximum | 540 |
| Range | 540 |
| Interquartile range (IQR) | 39.7 |
Descriptive statistics
| Standard deviation | 35.089424 |
|---|---|
| Coefficient of variation (CV) | 0.33927889 |
| Kurtosis | 3.154125 |
| Mean | 103.42354 |
| Median Absolute Deviation (MAD) | 20.25 |
| Skewness | 0.66713287 |
| Sum | 3751688.9 |
| Variance | 1231.2677 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 848 | 2.3% |
| 75 | 826 | 2.3% |
| 90 | 703 | 1.9% |
| 95 | 669 | 1.8% |
| 115 | 662 | 1.8% |
| 120 | 612 | 1.7% |
| 100 | 604 | 1.7% |
| 110 | 560 | 1.5% |
| 0 | 545 | 1.5% |
| 85 | 506 | 1.4% |
| Other values (3920) | 29740 |
| Value | Count | Frequency (%) |
| 0 | 545 | |
| 0.5 | 1 | < 0.1% |
| 1 | 9 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 6 | 25 | 0.1% |
| 6.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 540 | 1 | < 0.1% |
| 375.5 | 1 | < 0.1% |
| 365 | 1 | < 0.1% |
| 349.63 | 1 | < 0.1% |
| 332.57 | 1 | < 0.1% |
| 316 | 1 | < 0.1% |
| 314.1 | 1 | < 0.1% |
| 306 | 2 | < 0.1% |
| 300 | 5 | |
| 299.33 | 1 | < 0.1% |
no_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.61965541 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 19777 |
| Zeros (%) | 54.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7862359 |
|---|---|
| Coefficient of variation (CV) | 1.2688276 |
| Kurtosis | 0.88143702 |
| Mean | 0.61965541 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1450808 |
| Sum | 22478 |
| Variance | 0.61816689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19777 | |
| 1 | 11373 | |
| 2 | 4364 | 12.0% |
| 3 | 675 | 1.9% |
| 4 | 78 | 0.2% |
| 5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 19777 | |
| 1 | 11373 | |
| 2 | 4364 | 12.0% |
| 3 | 675 | 1.9% |
| 4 | 78 | 0.2% |
| 5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 8 | < 0.1% |
| 4 | 78 | 0.2% |
| 3 | 675 | 1.9% |
| 2 | 4364 | 12.0% |
| 1 | 11373 | |
| 0 | 19777 |
booking_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| Not_Canceled | |
|---|---|
| Canceled |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.689456 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not_Canceled |
|---|---|
| 2nd row | Not_Canceled |
| 3rd row | Canceled |
| 4th row | Canceled |
| 5th row | Canceled |
Common Values
| Value | Count | Frequency (%) |
| Not_Canceled | 24390 | |
| Canceled | 11885 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not_canceled | 24390 | |
| canceled | 11885 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Interactions
Correlations
| arrival_date | arrival_month | arrival_year | avg_price_per_room | booking_status | lead_time | market_segment_type | no_of_adults | no_of_children | no_of_previous_bookings_not_canceled | no_of_previous_cancellations | no_of_special_requests | no_of_week_nights | no_of_weekend_nights | repeated_guest | required_car_parking_space | room_type_reserved | type_of_meal_plan | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| arrival_date | 1.000 | -0.043 | 0.086 | 0.007 | 0.035 | 0.000 | 0.047 | 0.036 | 0.029 | -0.006 | -0.018 | 0.020 | -0.010 | 0.029 | 0.033 | 0.007 | 0.025 | 0.073 |
| arrival_month | -0.043 | 1.000 | 0.395 | 0.016 | 0.172 | 0.081 | 0.104 | 0.095 | -0.009 | -0.003 | 0.011 | 0.090 | 0.045 | -0.010 | 0.075 | 0.068 | 0.045 | 0.099 |
| arrival_year | 0.086 | 0.395 | 1.000 | 0.172 | 0.179 | 0.147 | 0.189 | 0.102 | 0.028 | 0.021 | 0.022 | 0.095 | 0.030 | 0.072 | 0.017 | 0.015 | 0.113 | 0.196 |
| avg_price_per_room | 0.007 | 0.016 | 0.172 | 1.000 | 0.165 | -0.021 | 0.317 | 0.161 | 0.244 | -0.178 | -0.103 | 0.198 | 0.018 | -0.026 | 0.161 | 0.064 | 0.278 | 0.104 |
| booking_status | 0.035 | 0.172 | 0.179 | 0.165 | 1.000 | 0.438 | 0.149 | 0.096 | 0.037 | 0.057 | 0.043 | 0.258 | 0.106 | 0.077 | 0.107 | 0.086 | 0.038 | 0.087 |
| lead_time | 0.000 | 0.081 | 0.147 | -0.021 | 0.438 | 1.000 | 0.176 | 0.098 | -0.026 | -0.191 | -0.101 | -0.081 | 0.245 | 0.099 | 0.164 | 0.069 | 0.067 | 0.173 |
| market_segment_type | 0.047 | 0.104 | 0.189 | 0.317 | 0.149 | 0.176 | 1.000 | 0.199 | 0.062 | 0.156 | 0.106 | 0.208 | 0.115 | 0.119 | 0.469 | 0.126 | 0.165 | 0.229 |
| no_of_adults | 0.036 | 0.095 | 0.102 | 0.161 | 0.096 | 0.098 | 0.199 | 1.000 | 0.181 | 0.069 | 0.043 | 0.112 | 0.075 | 0.068 | 0.224 | 0.018 | 0.329 | 0.090 |
| no_of_children | 0.029 | -0.009 | 0.028 | 0.244 | 0.037 | -0.026 | 0.062 | 0.181 | 1.000 | -0.034 | -0.026 | 0.135 | 0.019 | 0.031 | 0.025 | 0.032 | 0.406 | 0.037 |
| no_of_previous_bookings_not_canceled | -0.006 | -0.003 | 0.021 | -0.178 | 0.057 | -0.191 | 0.156 | 0.069 | -0.034 | 1.000 | 0.417 | 0.001 | -0.123 | -0.066 | 0.531 | 0.067 | 0.034 | 0.020 |
| no_of_previous_cancellations | -0.018 | 0.011 | 0.022 | -0.103 | 0.043 | -0.101 | 0.106 | 0.043 | -0.026 | 0.417 | 1.000 | -0.024 | -0.045 | -0.032 | 0.384 | 0.033 | 0.038 | 0.015 |
| no_of_special_requests | 0.020 | 0.090 | 0.095 | 0.198 | 0.258 | -0.081 | 0.208 | 0.112 | 0.135 | 0.001 | -0.024 | 1.000 | 0.045 | 0.066 | 0.039 | 0.095 | 0.075 | 0.070 |
| no_of_week_nights | -0.010 | 0.045 | 0.030 | 0.018 | 0.106 | 0.245 | 0.115 | 0.075 | 0.019 | -0.123 | -0.045 | 0.045 | 1.000 | 0.018 | 0.122 | 0.058 | 0.045 | 0.065 |
| no_of_weekend_nights | 0.029 | -0.010 | 0.072 | -0.026 | 0.077 | 0.099 | 0.119 | 0.068 | 0.031 | -0.066 | -0.032 | 0.066 | 0.018 | 1.000 | 0.067 | 0.029 | 0.030 | 0.045 |
| repeated_guest | 0.033 | 0.075 | 0.017 | 0.161 | 0.107 | 0.164 | 0.469 | 0.224 | 0.025 | 0.531 | 0.384 | 0.039 | 0.122 | 0.067 | 1.000 | 0.110 | 0.067 | 0.074 |
| required_car_parking_space | 0.007 | 0.068 | 0.015 | 0.064 | 0.086 | 0.069 | 0.126 | 0.018 | 0.032 | 0.067 | 0.033 | 0.095 | 0.058 | 0.029 | 0.110 | 1.000 | 0.045 | 0.034 |
| room_type_reserved | 0.025 | 0.045 | 0.113 | 0.278 | 0.038 | 0.067 | 0.165 | 0.329 | 0.406 | 0.034 | 0.038 | 0.075 | 0.045 | 0.030 | 0.067 | 0.045 | 1.000 | 0.146 |
| type_of_meal_plan | 0.073 | 0.099 | 0.196 | 0.104 | 0.087 | 0.173 | 0.229 | 0.090 | 0.037 | 0.020 | 0.015 | 0.070 | 0.065 | 0.045 | 0.074 | 0.034 | 0.146 | 1.000 |
Missing values
Sample
| no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 0 | 1 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 224 | 2017 | 10 | 2 | Offline | 0 | 0 | 0 | 65.00 | 0 | Not_Canceled |
| 1 | 2 | 0 | 2 | 3 | Not Selected | 0 | Room_Type 1 | 5 | 2018 | 11 | 6 | Online | 0 | 0 | 0 | 106.68 | 1 | Not_Canceled |
| 2 | 1 | 0 | 2 | 1 | Meal Plan 1 | 0 | Room_Type 1 | 1 | 2018 | 2 | 28 | Online | 0 | 0 | 0 | 60.00 | 0 | Canceled |
| 3 | 2 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 211 | 2018 | 5 | 20 | Online | 0 | 0 | 0 | 100.00 | 0 | Canceled |
| 4 | 2 | 0 | 1 | 1 | Not Selected | 0 | Room_Type 1 | 48 | 2018 | 4 | 11 | Online | 0 | 0 | 0 | 94.50 | 0 | Canceled |
| 5 | 2 | 0 | 0 | 2 | Meal Plan 2 | 0 | Room_Type 1 | 346 | 2018 | 9 | 13 | Online | 0 | 0 | 0 | 115.00 | 1 | Canceled |
| 6 | 2 | 0 | 1 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 34 | 2017 | 10 | 15 | Online | 0 | 0 | 0 | 107.55 | 1 | Not_Canceled |
| 7 | 2 | 0 | 1 | 3 | Meal Plan 1 | 0 | Room_Type 4 | 83 | 2018 | 12 | 26 | Online | 0 | 0 | 0 | 105.61 | 1 | Not_Canceled |
| 8 | 3 | 0 | 0 | 4 | Meal Plan 1 | 0 | Room_Type 1 | 121 | 2018 | 7 | 6 | Offline | 0 | 0 | 0 | 96.90 | 1 | Not_Canceled |
| 9 | 2 | 0 | 0 | 5 | Meal Plan 1 | 0 | Room_Type 4 | 44 | 2018 | 10 | 18 | Online | 0 | 0 | 0 | 133.44 | 3 | Not_Canceled |
| no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 36265 | 2 | 0 | 1 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 15 | 2018 | 5 | 30 | Online | 0 | 0 | 0 | 100.73 | 0 | Not_Canceled |
| 36266 | 2 | 0 | 2 | 2 | Meal Plan 1 | 0 | Room_Type 2 | 8 | 2018 | 3 | 4 | Online | 0 | 0 | 0 | 85.96 | 1 | Canceled |
| 36267 | 2 | 0 | 1 | 0 | Not Selected | 0 | Room_Type 1 | 49 | 2018 | 7 | 11 | Online | 0 | 0 | 0 | 93.15 | 0 | Canceled |
| 36268 | 1 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 166 | 2018 | 11 | 1 | Offline | 0 | 0 | 0 | 110.00 | 0 | Canceled |
| 36269 | 2 | 2 | 0 | 1 | Meal Plan 1 | 0 | Room_Type 6 | 0 | 2018 | 10 | 6 | Online | 0 | 0 | 0 | 216.00 | 0 | Canceled |
| 36270 | 3 | 0 | 2 | 6 | Meal Plan 1 | 0 | Room_Type 4 | 85 | 2018 | 8 | 3 | Online | 0 | 0 | 0 | 167.80 | 1 | Not_Canceled |
| 36271 | 2 | 0 | 1 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 228 | 2018 | 10 | 17 | Online | 0 | 0 | 0 | 90.95 | 2 | Canceled |
| 36272 | 2 | 0 | 2 | 6 | Meal Plan 1 | 0 | Room_Type 1 | 148 | 2018 | 7 | 1 | Online | 0 | 0 | 0 | 98.39 | 2 | Not_Canceled |
| 36273 | 2 | 0 | 0 | 3 | Not Selected | 0 | Room_Type 1 | 63 | 2018 | 4 | 21 | Online | 0 | 0 | 0 | 94.50 | 0 | Canceled |
| 36274 | 2 | 0 | 1 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 207 | 2018 | 12 | 30 | Offline | 0 | 0 | 0 | 161.67 | 0 | Not_Canceled |
Duplicate rows
Most frequently occurring
| no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 337 | 1 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 192 | 2018 | 6 | 24 | Offline | 0 | 0 | 0 | 95.0 | 0 | Not_Canceled | 91 |
| 419 | 1 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 71 | 2018 | 6 | 14 | Offline | 0 | 0 | 0 | 120.0 | 0 | Not_Canceled | 89 |
| 329 | 1 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 164 | 2017 | 10 | 2 | Offline | 0 | 0 | 0 | 100.0 | 0 | Not_Canceled | 87 |
| 1360 | 2 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 37 | 2018 | 10 | 13 | Offline | 0 | 0 | 0 | 105.0 | 0 | Not_Canceled | 84 |
| 335 | 1 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 188 | 2018 | 6 | 15 | Offline | 0 | 0 | 0 | 130.0 | 0 | Canceled | 83 |
| 1230 | 2 | 0 | 0 | 2 | Meal Plan 2 | 0 | Room_Type 1 | 39 | 2017 | 8 | 14 | Offline | 0 | 0 | 0 | 101.5 | 0 | Not_Canceled | 71 |
| 2066 | 2 | 0 | 1 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 305 | 2018 | 11 | 4 | Offline | 0 | 0 | 0 | 89.0 | 0 | Canceled | 71 |
| 1484 | 2 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 304 | 2018 | 11 | 3 | Offline | 0 | 0 | 0 | 89.0 | 0 | Canceled | 68 |
| 436 | 1 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 166 | 2018 | 11 | 1 | Offline | 0 | 0 | 0 | 110.0 | 0 | Canceled | 66 |
| 886 | 2 | 0 | 0 | 1 | Meal Plan 1 | 0 | Room_Type 1 | 56 | 2018 | 6 | 8 | Offline | 0 | 0 | 0 | 120.0 | 0 | Not_Canceled | 60 |